When SMILES Smiles, Practicality Judgment and Yield Prediction of Chemical Reaction via Deep Chemical Language Processing

نویسندگان

چکیده

Simplified Molecular Input Line Entry System (SMILES) provides a text-based encoding method to describe the structure of chemical species and formulize general reactions. Considering that reactions have been represented in language form, we present symbol only model generally predict yield organic synthesis reaction without considering complex quantum physical modeling or chemistry knowledge. Our is first deep neural network application treats text segments as embedding representation most recent natural processing. Experimental results show our can effectively reactions, which achieves high accuracy 99.76% on practicality judgment Root Mean Square Error (RMSE) around 0.2 for prediction. work shows great potential automatic prediction under conditions further applications path with least cost.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SMIREP: Predicting Chemical Activity from SMILES

Most approaches to structure-activity-relationship (SAR) prediction proceed in two steps. In the first step, a typically large set of fingerprints, or fragments of interest, is constructed (either by hand or by some recent data mining techniques). In the second step, machine learning techniques are applied to obtain a predictive model. The result is often not only a highly accurate but also har...

متن کامل

Smiles when sharing

One of the proposed functions of human smiling is to advertise cooperative dispositions and thereby increase the likelihood that a social partner would invest resources in a relationship. In particular, smiles involving an emotional component would be honest signals of altruistic dispositions because they are not easy to produce voluntarily. In this study, 60 people were covertly filmed while i...

متن کامل

Smiles Counting the Smiles

This research summary, based on a review of the literature and a series of focus groups with NHS staff, shows that while it is difficult to measure how people feel about their work, there is much to suggest that morale and motivation of the NHS workforce are low. It identifies three key factors that affect morale and motivation: whether staff feel valued, their working environment, and resource...

متن کامل

SMILES. 2. Algorithm for generation of unique SMILES notation

(24) Ritter, G. L.; Isenhour, T. L. Minimal Spanning Tree Clustering of Gas Chromatographic Liquid Phases. Comput. Chem. 1977, 1, 145-153. Everitt, B. Cluster Analysis; Halsted: New York, 1974. Balaban, A. T. Chemical Graphs. XXXIV. Five New Topological Indices for the Branching of Tree-Like Graphs. Theor. Chim. Acra 1979, 53, 355-375. Balaban, A. T.; Motoc, I. Chemical Graphs. XXXVI. Correlati...

متن کامل

Frequent SMILES

Predictive graph mining approaches in chemical databases are extremely popular and effective. Most of these approaches first extract frequent sub-graphs and then use them as features to build predictive models. In the work presented here, the approach taken is similar. However, instead of frequent sub-graphs, frequent trees, based on SMILES strings are derived. For this, the SMILES strings of c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2021

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2021.3083838